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^ ^cP SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS: Ji , Hongjun 

Rosen, Craig A, 

(ii) TITLE OF INVENTION: Breast Cancer Specific Gene 1 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox P.L.L.C. 

(B) STREET: 1100 New York Ave., Suite 600 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

{A) MEDIUM TYPE: Floppy disk 
{B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS- DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/017,715 

(B) FILING DATE: 1998-FEB-03 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/673,284 

(B) FILING DATE: 28-JUN-96 

( C ) CLASS I FICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/000,602 

(B) FILING DATE: 30-JUN-95 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/08295 

(B) FILING DATE: 30-JUN-95 

( C ) CLASSI FICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/037,080 

(B) FILING DATE:- 03-FEB-97 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMJ\TION : 

(A) NAME: Steffe, Eric K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 1488.0810003 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO : 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12 . . 392 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CACGAGCCAC C ATG GAT GTT TTC AAG AAG GGC TTC TCC ATC GCC AAG AAG 50 
Met Asp Val Phe Lys Lys Gly Phe Ser lie Ala Lys Lys 
1 5 10 

GGC GTG GTG GGT GCG GTG GAA AAG ACC AAG CAG GGG GTG ACG GAA GCA . 98 
Gly Val Val Gly Ala Val Glu Lys Thr Lys Gin Gly Val Thr Glu Ala 
15 20 25 

GCT GAG AAG ACC AAG GAG GGG GTC ATG TAT GTG GGA GCC AAG ACC AAG 14 6 

Ala Glu Lys Thr Lys Glu Gly Val Met Tyr Val Gly Ala Lys Thr Lys 
30 35 40 45 

GAG AAT GTT GTA CAG AGC GTG ACC TCA GTG GCC GAG AAG ACC AAG GAG 194 
Glu Asn Val Val Gin Ser Val Thr Ser Val Ala Glu Lys Thr Lys Glu 
50 55 60 

CAG GCC AAC GCC GTG AGC AAG GCT GTG GTG AGC AGC GTC AAC ACT GTG 24 2 

Gin Ala Asn Ala Val Ser Lys Ala Val Val Ser Ser Val Asn Thr Val 
65 70 75 

GCC ACC AAG ACC GTG GAG GAG GCG GAG AAC ATC GCG GTC ACC TCC GGG 2 90 

Ala Thr Lys Thr Val Glu Glu Ala Glu Asn lie Ala Val Thr Ser Gly 
80 85 90 

GTG GTG CGC AAG GAG GAC TTG AGG CCA TCT GCC CCC CAA CAG GAG GGT 338 
Val Val Arg Lys Glu Asp Leu Arg Pro Ser Ala 'Pro Gin Gin Glu Gly 
95 100 105 

GAG GCA TCC A7^ GAG AAA* GAG GAA GTG GCA GAG GAG GCC CAG AGT GGG 38 6 

Glu Ala Ser Lys Glu Lys Glu Glu Val Ala Glu Glu Ala Gin Ser Gly 
110 115 120 125 

GGA GAC TAGAGGGCTA CAGGCCAGCG TGGATGACCT GAAGAGCGCT CCTCTGCCTT 4 42 
Gly Asp 

GGACACCATC CCCTCCTAGC ACAAGGAGTG CCCGCCTTGA GTGACATGCG GGTGCCCACG 502 

CTCCTGCCCT CGTCTCCCTG GACACCCTTG GCCTGTCCAC CTGTGCTG 550 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Val Phe Lys Lys Gly Phe Ser lie Ala Lys Lys Gly Val Val 
15 10 15 

Gly Ala Val Glu Lys Thr Lys Gin Gly Val Thr Glu Ala Ala Glu Lys 
20 25 30 

Thr Lys Glu Gly Val Met Tyr Val Gly Ala Lys Thr Lys Glu Asn Val 
35 40 45 

Val Gin Ser Val Thr Ser Val Ala Glu Lys Thr Lys Glu Gin Ala Asn 
50 55 60 

Ala Val Ser Lys Ala Val Val Ser Ser Val Asn Thr Val Ala Thr Lys 
65 70 75 80 

Thr Val Glu Glu Ala Glu Asn lie Ala Val Thr Ser Gly Val Val Arg 
85 90 - 95 

Lys Glu Asp Leu Arg Pro Ser Ala Pro Gin Gin Glu Gly Glu Ala Ser 
100 105 110 

Lys Glu Lys Glu Glu Val Ala Glu Glu Ala Gin Ser Gly Gly Asp 
115 120 125 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGGATCCAT GTTTTCAAGA AGG 2 3 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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GGAAGCTTCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGGATCCCG ATGTTTTCAA GAAGG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGGTACCCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGGGATCCGC CACCATGTTT TCAAGAAGG 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GGGGATCCTC AGAAAGCGTA GTCTGGGACG TCGTATGGGT ACTAGTCTCC CCCACTCTGG 



60 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGGTACCTC ACTAGTCTCC CCCACTCTGG 30 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGTACCTAAG TGAGTAGGGC GTCCGATCGA CGGACGCCTT TTTTTTGAAT TCGTAATCAT 60 

GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 120 

CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG 180 

CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA 24 0 

TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA 300 

CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG 3 60 

TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC 4 20 

AGC7\AAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 4 80 

CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 54 0 

TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 600 

TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 6 60 
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GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 720 

ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 780 

ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 84 0 

CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 900 

GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 9 60 

GTAGCTCTTG ATCCGGCAAA CA7VACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 1020 

AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 108 0 

CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCGTCGA 114 0 

CAATTCGCGC GCGAAGGCGA AGCGGCATGC ATTTACGTTG ACACCATCGA ATGGTGCAAA 12 00 

ACCTTTCGCG GTATGGCATG ATAGCGCCCG GAAGAGAGTC AATTCAGGGT GGTGAATGTG 12 60 

AAACCAGTT^ CGTTATACGA TGTCGCAGAG TATGCCGGTG TCTCTTATCA GACCGTTTCC 1320 

CGCGTGGTGA ACCAGGCCAG CCACGTTTCT GCGAAAACGC GGGAAAAAGT GGAAGCGGCG 138 0 

ATGGCGGAGC TGAATTACAT TCCCAACCGC GTGGCACAAC AACTGGCGGG CAAACAGTCG 14 4 0 

TTGCTGATTG GCGTTGCCAC CTCCAGTCTG GCCCTGCACG CGCCGTCGCA AATTGTCGCG 1500 

GCGATTAAAT CTCGCGCCGA TCAACTGGGT GCCAGCGTGG TGGTGTCGAT GGTAGAACGA 15 60 

AGCGGCGTCG AAGCCTGTAA AGCGGCGGTG CACAATCTTC TCGCGCAACG CGTCAGTGGG 1620 

CTGATCATTA ACTATCCGCT GGATGACCAG GATGCCATTG CTGTGGAAGC TGCCTGCACT 168 0 

AATGTTCCGG CGTTATTTCT TGATGTCTCT GACCAGACAC CCATCAACAG TATTATTTTC 17 4 0 

TCCCATGAAG ACGGTACGCG ACTGGGCGTG GAGCATCTGG TCGCATTGGG TCACCAGCAA 18 00 

ATCGCGCTGT TAGCGGGCCC ATTAAGTTCT GTCTCGGCGC GTCTGCGTCT GGCTGGCTGG 18 60 

CATAAATATC TCACTCGCAA TCAAATTCAG CCGATAGCGG AACGGGAAGG CGACTGGAGT 1920 

GCCATGTCCG GTTTTCAACA AACCATGCAA ATGCTGAATG AGGGCATCGT TCCCACTGCG 198 0 

ATGCTGGTTG CCT^ACGATCA GATGGCGCTG GGCGCAATGC GCGCCATTAC CGAGTCCGGG 2 04 0 

CTGCGCGTTG GTGCGGATAT CTCGGTAGTG GGATACGACG ATACCGAAGA CAGCTCATGT 2100 

TATATCCCGC CGTTAACCAC CATCAAACAG GATTTTCGCC TGCTGGGGCA AACCAGCGTG 2160 

GACCGCTTGC TGCAACTCTC TCAGGGCCAG GCGGTGAAGG GCAATCAGCT GTTGCCCGTC 2220 

TCACTGGTGA AAAGAAA7\AC CACCCTGGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG 22 8 0 

TTGGCCGATT GATTAATGCA GCTGGCACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA 2340 

GCGCAACGCA ATTAATGTAA GTTAGCGCGA ATTGTCGACC AAAGCGGCCA TCGTGCCTCC 2 4 00 

CCACTCCTGC AGTTCGGGGG CATGGATGCG CGGATAGCCG CTGCTGGTTT CCTGGATGCC 2 4 60 

GACGGATTTG CACTGCCGGT AGAACTCCGC GAGGTCGTCC AGCCTCAGGC AGCAGCTGAA 2 52 0 



A 
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CCAACTCGCG AGGGGATCGA GCCCGGGGTG GGCGAAGT^C TCCAGCATGA GATCCCCGCG 2 580 

CTGGAGGATC ATCCAGCCGG CGTCCCGGAA AACGATTCCG 7\AGCCCAACC TTTCATAGAA 2 64 0 

GGCGGCGGTG GAATCGAAAT CTCGTGATGG CAGGTTGGGC GTCGCTTGGT CGGTCATTTC 2700 

GAACCCCAGA GTCCCGCTCA GAAGAACTCG TCAAGAAGGC GATAGAAGGC GATGCGCTGC 27 60 

GAATCGGGAG CGGCGATACC GTAAAGCACG AGGAAGCGGT CAGCCCATTC GCCGCCAAGC 282 0 

TCTTCAGCAA TATCACGGGT AGCCAACGCT ATGTCCTGAT AGCGGTCCGC CACACCCAGC 2 880 

CGGCCACAGT CGATGAATCC AGAAAAGCGG CCATTTTCCA CCATGATATT CGGCAAGCAG 2 94 0 

GCATCGCCAT GGGTCACGAC GAGATCCTCG CCGTCGGGCA TGCGCGCCTT GAGCCTGGCG 3000 

AACAGTTCGG CTGGCGCGAG CCCCTGATGC TCTTCGTCCA GATCATCCTG ATCGACAAGA 30 60 

CCGGCTTCCA TCCGAGTACG TGCTCGCTCG ATGCGATGTT TCGCTTGGTG GTCGAATGGG 3120 

CAGGTAGCCG GATCAAGCGT ATGCAGCCGC CGCATTGCAT CAGCCATGAT GGATACTTTC 3180 

TCGGCAGGAG CAAGGTGAGA TGACAGGAGA TCCTGCCCCG GCACTTCGCC CAATAGCAGC 32 4 0 

CAGTCCCTTC CCGCTTCAGT GACAACGTCG AGCACAGCTG CGCAAGGAAC GCCCGTCGTG 3300 

GCCAGCCACG ATAGCCGCGC TGCCTCGTCC TGCAGTTCAT TCAGGGCACC GGACAGGTCG 33 60 

GTCTTGACAA AAAGAACCGG GCGCCCCTGC GCTGACAGCC GGAACACGGC GGCATCAGAG 3 4 20 

CAGCCGATTG TCTGTTGTGC CCAGTCATAG CCGAATAGCC TCTCCACCCA AGCGGCCGGA 34 8 0 

GAACCTGCGT GCAATCCATC TTGTTCAATC ATGCGAT^CG ATCCTCATCC TGTCTCTTGA 35 4 0 

TCAGATCTTG ATCCCCTGCG CCATCAGATC CTTGGCGGCA AGAAAGCCAT CCAGTTTACT 3 600 

TTGCAGGGCT TCCCAACCTT ACCAGAGGGC GCCCCAGCTG GCAATTCCGG TTCGCTTGCT 3 660 

GTCCATAAAA CCGCCCAGTC TAGCTATCGC CATGTAAGCC CACTGCAAGC TACCTGCTTT 37 20 

CTCTTTGCGC TTGCGTTTTC CCTTGTCCAG ATAGCCCAGT AGCTGACATT CATCCGGGGT 37 8 0 

CAGCACCGTT TCTGCGGACT GGCTTTCTAC GTGTTCCGCT TCCTTTAGCA GCCCTTGCGC 38 4 0 

CCTGAGTGCT TGCGGCAGCG TGAAGCTTAA AA/^CTGCT^A AAAATAGTTT GACTTGTGAG 3900 

CGGATAACAA TTAAGATGTA CCCAATTGTG AGCGGATAAC AATTTCACAC ATTAAAGAGG 3960 

AGAAATTACA TATG 3 97 4 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 
-(B) TY-PE:: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA ^ 



A 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 



AAGCTTAAAA AACTGCAAAA AATAGTTTGA CTTGTGAGCG GATAACAATT AAGATGTACC 



60 



CAATTGTGAG CGGATAACAA TTTCACACAT TAAAGAGGAG 7\AATTACATA TG 



112 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCCGCTGCGG CAGACTCGAG CCAGCTCAAG CCCGCAGCTC GCAGGGAGAT CCAGCTCCGT 60 

CCTGCCTGCA GCAGCCAACC CTGCACACCC ACCATGGATG TTTCAAGAAG GGCTTCTCCA 12 0 

TCGCCAAGGA GGGCGTGGTG GGTGCGGTGG AAAAGACCAA GCAGGGGGTG ACGGAAGCAG 180 

CTGAGAAGAC CAAGGAGGGG GTCATGTATG TGGGAGCCAA GACCAAGGAG AATGTTGTAT 24 0 

GTACAGAGCG TGACCTCAGT GGCCGAGAAG ACCAAGGAGC AGGCCAACGC CGTGAGCAAG 300 

GCTGTGGTGA GCAGCGTCAA CACTTGGCCA CCAAGACCGT GAGGAGGCGG AG7VACATCGC 3 60 

GGTCACTCCG GGTGTGCGCA AGGAGGATTA GGCCATTCCC CCCAACAGGA GGGTGAGGCA 4 20 

TCAAGAAGAG AAGGGCAGGC AGAGTGGGGG AGACTAGAGG GCTACAGGCC AGCTTGGATG 4 80 

ACCTG7\AGAG CGCTCCTCTG CCTTGGGACA CCATCCCCTC CTAGCACAAG GAGTGCCCGC 54 0 

TTTGAGTGGA CATGCGGCTG TCCCACGTTC CTGCCCTCGT TTTCCCTGGG CCACCTTGGC 600 

CTGTCCAACT GTGCTGTTGC AACCAACTTA ATTGCCTTCC TTGGGCCCCA ACCAACTTTT 6 60 

TGGTTCTTTT TGACCCATTT ATGTTTGTTG TGAATTTTTT TTTT7\7^AAGA TTTCAAATAA 720 
AATTTGGGCC CATTTTTTAA AAAAAAAAAA AAAAA 755 



# 
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SEQUENCE LISTING 



SI 

I™: 

y I 



us 



(1) GENERAL INF*S)RMATION : 

(i) APPLICANTS: Ji, Hongjun 

Rosen, Craig A. 

(ii) TITLE OF INDENTION: Breast Cancer Specific Gene 1 

(iii) NUMBER OF SEQUENCES: 11 

( i V ) CORRES PONDENCeX ADDRESS : 

(A) ADDRESSEE :\ Sterne, Kessler, Goldstein & Fox P.L.L.C. 

(B) STREET: llftO New York Ave., Suite 600 

(C) CITY: Washi^^gton 

(D) STATE: DC 

(E) COUNTRY: USA^ 

(F) ZIP: 20005-35^4 

(v) COMPUTER READABLE FtoRM: 

(A) MEDIUM TYPE: Flippy disk 

(B) COMPUTER: IBM PG compatible 

(C) OPEEIATING SYSTEm\ PC-DOS /MS-DOS 

(D) SOFTWARE: PatentlTi Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DAI 

(A) APPLICATION NUMBE:i\ 

(B) FILING DATE: HERsfy 

(C) CLASSIFI CAT I ON : 



(vii) PRIOR APPLICATION D? 

(A) APPLICATION NUMBEk: U^ 08> 

(B) FILING DATE: 23-jilN-9^ 

(C) classification! 



BE ASSIGNED 



i, 284 



(vii) PRIOR application DAI 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 30-JUN-95 
. (C) CLASSIFICATION: 



6®/000, 602 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US^ 

(B) FILING DATE: 30-JUN-95 

( C ) CLAS S I FI CAT I ON : 



i/08295 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/037,\080 

(B) FILING DATE: 03-FEB-97 

( C ) CLAS S I FI CAT ION : 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Steffe, Eric K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE/DOCKET NUMBER: 14 88. 081C 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



)03 



(2) INFORMATION FOR SEQ ID N0:1: 



1=1 
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(ii) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asp Va\ Phe Lys Lys Gly Phe Ser lie Ala Lys 
1X5 10 

Gly Ala Val Glu Lys Thr Lys Gin Gly Val Thr Glu 
ZO 25 

Thr Lys Glu Gly Wal Met Tyr Val Gly Ala Lys Thr 
35 \ 40 

Val Gin Ser Val Thr Ser Val Ala Glu Lys Thr Lys 
50 \ 55 60 

Ala Val Ser Lys Ala Val Val Ser Ser Val Asn Thr 
65 \70 75 

Thr Val Glu Glu Ala Gl\a Asn He Ala Val Thr Ser 
85 \ 90 

Lys Glu Asp Leu Arg Pro Ser Ala Pro Gin Gin Glu 
100 \ 105 

Lys Glu Lys Glu Glu Val AlA Glu Glu Ala Gin Ser 
115 \l20 

(2) INFORMATION FOR SEQ ID NQ: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base Aairs 

(B) TYPE: nucleic aci\ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Lys Gly Val Val 
15 

Ala Ala Glu Lys 
30 

Lys Glu Asn Val 
45 

Glu Gin Ala Asn 



Val Ala Thr Lys 
80 

Gly Val Val Arg 
95 

Gly Glu Ala Ser 
110 

Gly Gly Asp 
125 



(xi) SEQUENCE DESCRIPTION: SEQ IIXN0:3: 
GGGGATCCAT GTTTTCAAGA AGG 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGAAGCTTCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO : 5 : 



27 



# 
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(iX SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
\{B) TYPE: nucleic acid 

STRANDEDNESS : single 
{t\) TOPOLOGY: linear 

(ii) MOLEJsULE TYPE: cDNA 



(xi) SEQUENCE I^ESCRIPTION : SEQ ID NO: 5: 
GGGGATCCCG ATGTTTTCAA GAAGG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27\base pairs 

(B) TYPE: nucleVc acid 

(C) STRANDEDNESSy: single 

(D) TOPOLOGY: liXear 

(ii) MOLECULE TYPE: cDnA 



25 



G 

w 



(xi) SEQUENCE DESCRIPTION: 
GGGGTACCCT AGTCTCCCCC ACTCTGG 
(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



i:Q ID NO: 6: 



27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GGGGATCCGC CACCATGTTT TCAAGAAGG 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



29 



(ii) MOLECULE TYPE: cDNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GGGGOTCCTC AGAAAGCGTA GTCTGGGACG TCGTATGGGT ACTAGTCTCC CCCACTCTGG 



60 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) \ LENGTH: 30 base pairs 

(B) \rYPE: nucleic acid 

(C) S^RANDEDNESS : single 

(D) TOTOLOGY: linear 

(ii) MOLECULE ^YPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGGTACCTC ACTAGTCTCC CCSCACTCTGG 

(2) INFORMATION FOR SEQ IE) NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3974 l:^se pairs 

(B) TYPE: nucleic aS^idj 

(C) STRANDEDNESS: bd^h/ 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



30 



m 
y 



(xi) SEQUENCE DESCRIPTION 

GGTACCTAAG TGAGTAGGGC GTCCGAT 

GGTCATAGCT GTTTCCTGTG TGAAAT. 

CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG 

CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG 

TCGGCCAACG CGCGGGGAGA GGCGGTTTGC 

CTGACTCGCT GCGCTCGGTC GTTCGGCTGC 

TAATACGGTT ATCCACAGAA TCAGGGGATA 

AGCAAAAGGC CAGGAACCGT AAAAAGGCCG 

CCCCTGACGA GCATCACAAA AATCGACGCT 

TATAAAGATA CCAGGCGTTT CCCCCTGGAA 

TGCCGCTTAC CGGATACCTG TCCGCCTTTC 

GCTCACGCTG TAGGTATCTC AGTTCGGTGT 

ACGAACCCCC CGTTCAGCCC GACCGCTGCG 

ACCCGGTAAG ACACGACTTA TCGCCACTGG 



SEQ ID\NO:10: 
GA CGGACOCCTT 
GTT ATCCGCtVaC 
CCTAATGAOT 
GAAACCTGTC 
GTATTGGGCG 
GGCGAGCGGT 
ACGCAGGAAA 
CGTTGCTGGC 
CAAGTCAGAG 
GCTCCCTCGT 
TCCCTTCGGG 
AGGTCGTTCG 
CCTTATCCGG 
CAGCAGCCAC 



TTTTTTGAAT 

aattccacac 
gagctaactc 
gtgccagctg 
tcttccgct 
atvcagctcac 
ga^atgtga 
gtttVtccat 

AAAC 
GCGCTCTCCT 
AAGCGTGGCG 
CTCCAAGCTSIG 
TAACTATCGI 
TGGTAACAGG 



GTGGC5 



TCGTAATCAT 
AACATACGAG 
ACATTAATTG 
CATTAATGAA 
TCCTCGCTCA 
TCAAAGGCGG 
GCAAAAGGCC 
AGGCTCCGCC 
CCGACAGGAC 
GTTCCGACCC 
CTTTCTCATA 
GGCTGTGTGC 
CTTGAGTCCA 
.TTAGCAGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



# 



o 



CGJ^GGTATGT 
GAAG)^CAGT 
GTAGCTCTTG 
AGCAGATTAC 
CTGACGCTCA^ 
CAATTCGCGC 
ACCTTTCGCG 
AAACCAGTAA 
CGCGTGGTGA 
ATGGCGGAGC 
TTGCTGATTG 
GCGATTAAAT 
AGCGGCGTCG 
CTGATCATTA 
AATGTTCCGG 
TCCCATGAAG 
ATCGCGCTGT 
CATAAATATC 
GCCATGTCCG 
ATGCTGGTTG 
CTGCGCGTTG 
TATATCCCGC 
GACCGCTTGC 
TCACTGGTGA 
TTGGCCGATT 
GCGCAACGCA 
CCACTCCTGC 
GACGGATTTG 
CCAACTCGCG 
CTGGAGGATC 
GGCGGCGGTG 
GAACCCCAGA 



GTAJ 



AGGCGGTGCT 
ATTTGGTATC 

atccggcaaa 
gcgcagaaaa 
gtggt^cgaa 
;gaaggcga 

GGCATG 
CGTTATACGA 
ACCAGGCSCAG 
TGAATTAC^T 
GCGTTGCCAC 
CTCGCGCCGA 
AAGCCTGTAA 
ACTATCCGCT 
CGTTATTTCT 
ACGGTACGCG 
TAGCGGGCCC 
TCACTCGCAA 
GTTTTCAACA 
CCAACGATCA 
GTGCGGATAT 
CGTTAACCAC 
TGCAACTCTC 
AAAGAAAAAC 
CATTAATGCA 
ATTAATGTAA 
AGTTCGGGGG 
CACTGCCGGT 
AGGGGATCGA 
ATCCAGCCGG 
GAATCGAAAT 
GTCCCGCTCA 



\CTCT 



ACAGAGTTCT 
TGCGCTCTGC 
CTUyVCCACCG 
AAAGGATCTC 
AACTCACGTT 
AGCGGCATGC 
ATAGCGCCCG 
TGTCGCAGAG 
CCACGTTTCT 
TCCCAACCGC 

ctccagtctg 
:aactgggt 
agcsggcggtg 

GGATGACCAG 
TGATG' 
ACTGGGCfeTG 
ATTAAGTTi 
TCAAATTCAG^ 
AACCATGCAA 
GATGGCGCTG 
CTCGGTAGTG 
CATCAAACAG 
TCAGGGCCAG 
CACCCTGGCG 
GCTGGCACGA 
GTTAGCGCGA 
CATGGATGCG 
AGAACTCCGC 
GCCCGGGGTG 
CGTCCCGGAA 
CTCGTGATGG 
GAAGAACTCG 



-57- 

TGAAGTGGTG 
TGAAGCCAGT 
CTGGTAGCGG 
AAGAAGATCC 
AAGGGATTTT 
ATTTACGTTG 

gaagagagtc 
tatgccggtg 
gcgaaaacgc 
gtggcacaac 
gccctgcacg 
gccagcgtgg 
cacaatcttc 
gatgccattg 
gaccagacac 
gagcatctgg 
gtctcggcgc 
ccgatagcgg 
^gctgaatg 
ggVgcaatgc 
ggat^cgacg 
gatttVcgcc 
gcggtgaagg 

CCCAATAcVc 

CAGGTTTCC* 

ATTGTCGACC 

CGGATAGCCG 

GAGGTCGTCC 

GGCGT^GAAC 

AACGATTCCG 

CAGGTTGGGC 

TCAAGAAGGC 



GCCTAACTAC 
TACCTTCGGA 
TGGTTTTTTT 
TTTGATCTTT 
GGTCATGAGA 
ACACCATCGA 
AATTCAGGGT 
TCTCTTATCA 
GGGAAAAAGT 
AACTGGCGGG 
CGCCGTCGCA 
TGGTGTCGAT 
TCGCGCAACG 
CTGTGGAAGC 
CCATCAACAG 
TCGCATTGGG 
GTCTGCGTCT 
AACGGGAAGG 
AGGGCATCGT 
GCGCCATTAC 
ATACCGAAGA 
TGCTGGGGCA 
GCAATCAGCT 
AAACCGCCTC 
GACTGGAAAG 
GCGGCCA 
CTGCTGGTTT 
AG( 

TCCAGCATGA 

aagcccaacc 
gtcgcAggt 
gatagaagc 



CCTCAGGC 



GGCTACACTA 
AAAAGAGTTG 
GTTTGCAAGC 
TCTACGGGGT 
TTATCGTCGA 
ATGGTGCAAA 
GGTGAATGTG 
GACCGTTTCC 
GGAAGCGGCG 
CAAACAGTCG 
AATTGTCGCG 
GGTAGAACGA 
CGTCAGTGGG 
TGCCTGCACT 
TATTATTTTC 
TCACCAGCAA 
GGCTGGCTGG 
CGACTGGAGT 
TCCCACTGCG 
CGAGTCCGGG 
CAGCTCATGT 
AACCAGCGTG 
GTTGCCCGTC 
TCCCCGCGCG 
CGGGCAGTGA 
TCGTGCCTCC 
CCTGGATGCC 
AGCAGCTGAA 
GATCCCCGCG 
TTTCATAGAA 
CGGTCATTTC 
GATGCGCTGC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 




lU 

□ 



-58- 

GAATCC^GAG CGGCGATACC GTAAAGCACG AGGAAGCGGT 
TCTTCAGC^ TATCACGGGT AGCCAACGCT ' ATGTCCTGAT 
CGGCCACAG'iv CGATGAATCC AGAAAAGCGG CCATTTTCCA 
GCATCGCCAT (SGGTCACGAC GAGATCCTCG CCGTCGGGCA 
AACAGTTCGG CTGGCGCGAG CCCCTGATGC TCTTCGTCCA 
CCGGCTTCCA TCCGAGTACG TGCTCGCTCG ATGCGATGTT 
CAGGTAGCCG GATCAAGCGT ATGCAGCCGC CGCATTGCAT 
TCGGCAGGAG CAAGG'TGAGA TGACAGGAGA TCCTGCCCCG 
CAGTCCCTTC CCGCTTCAGT GACAACGTCG AGCACAGCTG 
GCCAGCCACG ATAGCCGCffiC TGCCTCGTCC TGCAGTTCAT 
GTCTTGACAA AAAGAACCGG GCGCCCCTGC GCTGACAGCC 
CAGCCGATTG TCTGTTGTGc\cCAGTCATAG CCGAATAGCC 
GAACCTGCGT GCAATCCATC TXlGXTCAATC ATGCGAAACG 
TCAGATCTTG ATCCCCTGCG CC^TCAGATC CTTGGCGGCA 




GCCCCAGCTG 
TGTAAGCC 
GCCCAGT 
CCGCT 
CAA 
AGCGGATAAC 



CAGCCGATTG 
AGCGGTCCGC 
CCATGATATT 
TGCGCGCCTT 
GATCATCCTG 
TCGCTTGGTG 
CAGCCATGAT 
GCACTTCGCC 
CGCAAGGAAC 
TCAGGGCACC 
GGAACACGGC 
TCTCCACCCA 
ATCCTCATCC 
AGAAAGCCAT 
GCAATTCCGG 
CACTGCAAGC 
AGCTGACATT 
TCCTTTAGCA 
AAT^TAGTTT 
AATTTCACAC 



GCCGCCAAGC 
CACACCCAGC 
CGGCAAGCAG 
GAGCCTGGCG 
ATCGACAAGA 
GTCGAATGGG 
GGATACTTTC 
CAATAGCAGC 
GCCCGTCGTG 
GGACAGGTCG 
GGCATCAGAG 
AGCGGCCGGA 
TGTCTCTTGA 
CCAGTTTACT 
TTCGCTTGCT 
TACCTGCTTT 
CATCCGGGGT 
GCCCTTGCGC 
GACTTGTGAG 
ATTAAAGAGG 



TTGCAGGGCT TCCCAACCTT AGO 
GTCCATAAAA CCGCCCAGTC TAGC' 
CTCTTTGCGC TTGCGTTTTC CCTTG 
CAGCACCGTT TCTGCGGACT GGCTTT 
CCTGAGTGCT TGCGGCAGCG TGAAG 
CGGAT7VACAA TTAAGATGTA CCC 
AGAAATTACA TATG 
(2) INFORMATION FOR SEQ I 



(i) SEQUENCE CHARACTE 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:li; 
AAGCTTAAAA AACTGCAAAA AATAGTTTGA CTTGTGAGCG GA^AACAATT AAGATGTACC 
CAATTGTGAG CGGATAACAA TTTCACACAT TAAAGAGGAG AAAOTACATA TG 



2820 

2880 

2940 

3000 ■ 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

3974 



60 
112 



